NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Mastering Long-Tail Complexity on Graphs: Characterization, Learning, and Generalization

https://doi.org/10.1145/3637528.3671880

Wang, Haohui; Jing, Baoyu; Ding, Kaize; Zhu, Yada; Cheng, Wei; Zhang, Si; Fan, Yonghui; Zhang, Liqing; Zhou, Dawei (August 2024, ACM)

Full Text Available
Learning Node Abnormality with Weak Supervision

https://doi.org/10.1145/3583780.3614950

Zhou, Qinghai; Ding, Kaize; Liu, Huan; Tong, Hanghang (October 2023, ACM)

Full Text Available
Virtual Node Tuning for Few-shot Node Classification

Tan, Zhen; Ding Kaize; Guo, Ruocheng; Liu, Huan (August 2023, ACM)

Full Text Available
STREAMS: Towards Spatio-Temporal Causal Discovery with Reinforcement Learning for Streamflow Rate Prediction

https://doi.org/10.1145/3583780.3614719

Sheth, Paras; Mosallanezhad, Ahmadreza; Ding, Kaize; Shah, Reepal; Sabo, John; Liu, Huan; Candan, K Selçuk (October 2023, Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (CIKM '23).)
Federated Few-shot Learning

https://doi.org/10.1145/3580305.3599347

Wang, Song; Fu, Xingbo; Ding, Kaize; Chen, Chen; Chen, Huiyuan; Li, Jundong (August 2023, ACM)

Federated Learning (FL) enables multiple clients to collaboratively learn a machine learning model without exchanging their own local data. In this way, the server can exploit the computational power of all clients and train the model on a larger set of data samples among all clients. Although such a mechanism is proven to be effective in various fields, existing works generally assume that each client preserves sufficient data for training. In practice, however, certain clients can only contain a limited number of samples (i.e., few-shot samples). For example, the available photo data taken by a specific user with a new mobile device is relatively rare. In this scenario, existing FL efforts typically encounter a significant performance drop on these clients. Therefore, it is urgent to develop a few-shot model that can generalize to clients with limited data under the FL scenario. In this paper, we refer to this novel problem as federated few-shot learning. Nevertheless, the problem remains challenging due to two major reasons: the global data variance among clients (i.e., the difference in data distributions among clients) and the local data insufficiency in each client (i.e., the lack of adequate local data for training). To overcome these two challenges, we propose a novel federated few-shot learning framework with two separately updated models and dedicated training strategies to reduce the adverse impact of global data variance and local data insufficiency. Extensive experiments on four prevalent datasets that cover news articles and images validate the effectiveness of our framework compared with the state-of-the-art baselines.
more » « less
Full Text Available
Supervised Graph Contrastive Learning for Few-Shot Node Classification

https://doi.org/10.1007/978-3-031-26390-3_24

Tan, Zhen; Ding, Kaize; Guo, Ruocheng; Liu, Huan (March 2023, Springer)

Full Text Available
Few-shot Node Classification with Extremely Weak Supervision

https://doi.org/10.1145/3539597.3570435

Wang, Song; Dong, Yushun; Ding, Kaize; Chen, Chen; Li, Jundong (February 2023, Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining)

Few-shot node classification aims at classifying nodes with limited labeled nodes as references. Recent few-shot node classification methods typically learn from classes with abundant labeled nodes (i.e., meta-training classes) and then generalize to classes with limited labeled nodes (i.e., meta-test classes). Nevertheless, on real-world graphs, it is usually difficult to obtain abundant labeled nodes for many classes. In practice, each meta-training class can only consist of several labeled nodes, known as the extremely weak supervision problem. In few-shot node classification, with extremely limited labeled nodes for meta-training, the generalization gap between meta-training and meta-test will become larger and thus lead to suboptimal performance. To tackle this issue, we study a novel problem of few-shot node classification with extremely weak supervision and propose a principled framework X-FNC under the prevalent meta-learning framework. Specifically, our goal is to accumulate meta-knowledge across different meta-training tasks with extremely weak supervision and generalize such knowledge to meta-test tasks. To address the challenges resulting from extremely scarce labeled nodes, we propose two essential modules to obtain pseudo-labeled nodes as extra references and effectively learn from extremely limited supervision information. We further conduct extensive experiments on four node classification datasets with extremely weak supervision to validate the superiority of our framework compared to the state-of-the-art baselines.
more » « less
Full Text Available
Data Augmentation for Deep Graph Learning: A Survey

https://doi.org/10.1145/3575637.3575646

Ding, Kaize; Xu, Zhe; Tong, Hanghang; Liu, Huan (November 2022, ACM SIGKDD Explorations Newsletter)

Graph neural networks, a powerful deep learning tool to model graph-structured data, have demonstrated remarkable performance on numerous graph learning tasks. To address the data noise and data scarcity issues in deep graph learning, the research on graph data augmentation has intensified lately. However, conventional data augmentation methods can hardly handle graph-structured data which is defined in non-Euclidean space with multi-modality. In this survey, we formally formulate the problem of graph data augmentation and further review the representative techniques and their applications in different deep graph learning problems. Specifically, we first propose a taxonomy for graph data augmentation techniques and then provide a structured review by categorizing the related work based on the augmented information modalities. Moreover, we summarize the applications of graph data augmentation in two representative problems in data-centric deep graph learning: (1) reliable graph learning which focuses on enhancing the utility of input graph as well as the model capacity via graph data augmentation; and (2) low-resource graph learning which targets on enlarging the labeled training data scale through graph data augmentation. For each problem, we also provide a hierarchical problem taxonomy and review the existing literature related to graph data augmentation. Finally, we point out promising research directions and the challenges in future research.
more » « less
Full Text Available
Generalized few-shot node classification on graphs

Xu, Zhe; Ding, Kaize; Wang, Yu-Xiong; Liu, Huan; Tong, Hanghang (November 2022, IEEE International Conference on Data Mining)

Full Text Available
Generalized Few-Shot Node Classification

https://doi.org/10.1109/ICDM54844.2022.00071

Xu, Zhe; Ding, Kaize; Wang, Yu-Xiong; Liu, Huan; Tong, Hanghang (November 2022, 2022 IEEE International Conference on Data Mining (ICDM))

For real-world graph data, the node class distribution is inherently imbalanced and long-tailed, which naturally leads to a few-shot learning scenario with limited nodes labeled for newly emerging classes. Existing efforts are carefully designed to solve such a few-shot learning problem via data augmentation, learning transferable initialization, to name a few. However, most, if not all, of them are based on a strong assumption that all the test nodes must exclusively come from novel classes, which is impractical in real-world applications. In this paper, we study a broader and more realistic problem named generalized few-shot node classification, where the test samples can be from both novel classes and base classes. Compared with the standard fewshot node classification, this new problem imposes several unique challenges, including asymmetric classification and inconsistent preference. To counter those challenges, we propose a shot-aware graph neural network (STAGER) equipped with an uncertainty-based weight assigner module for adaptive propagation. To formulate this problem from the meta-learning perspective, we propose a new training paradigm named imbalanced episodic training to ensure the label distribution is consistent between the training and test scenarios. Experiment results on four real-world datasets demonstrate the efficacy of our model, with up to 14% accuracy improvement over baselines.
more » « less
Full Text Available

« Prev Next »

Search for: All records